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Path sharing and predicate evaluation for high-performance XML filtering psu.edu [PDFl 

Y Diao, M Altinel, MJ Franklin, H Zhang, P ... - ACM Transactions on 2003 - portal.acm.org 
... highly skewed, with a small number of very long candidate lists that do not provide much selectivity. ... 
As experi- mental results in Altinel and Franklin [2000] show, this additional cost is far outweighed ... 
a document arrives at the filtering en- gine, it is run through an XML Parser that ... 
Cited by 273 - Related articles - BL Direct - All 25 versions 



Efficient processing of XML twig gueries with QR-predicates psu.edu [PDF] 

H Jiang, H Lu, W Wang - Proceedings of the 2004 ACM SIGMOD .., 2004 - portal.acm.org 
... indexes can significantly improve the performance for matching twig queries with OR-predicates, 
especially when ... can evaluate query Q2 as two separate AND-twigs: /dblp/paper[title='XML']//author 
/dblp ... we may scan same data multiple times, incurring more I/O and CPU cost. ... 
Cited by 78 - Related articles - All 17 versions 



[PDFi Cost-based optimization in DB2 XML psu.edu [PDF] 

A Balmin, T Eliaz, J Hornibrook, L Lim, GM Lohman, ... - IBM Systems .., 2006 - Citeseer 

... that might satisfy those predicates; that is, nodes satisfying the individual predicates must descend ... 

Fanout is used in conjunction with the traditional notion of selectivity in determining the ... Statistics 

collection" describes the set of statistics used to make XML cost and cardinality ... 
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Benefits of path summaries in an XML query optimizer supporting multiple access methods 
A Barta, MP Consens, AO Mendelzon - Proceedings of the 31st 2005 - portal.acm.org 
... The advantage of using this strategy occurs in the case of selective predicates on one of the ... 
However, in the XML query c summaries can be used not only existing schemas for a ... The latter 
optimization str data statistics and (simple) cost-based h order to compute an efficient plan ... 
Cited by 31 - Related articles - BL Direct - All 4 versions 



Approximate XML query answers psu.edu [PDF] 

N Polyzotis, M Garofalakis, Y loannidis - Proceedings of the 2004 2004 - portal.acm.org 
... Generating approximate answers is a cost-effective solution for offsetting the high evaluation 
cost of XML queries. ... Note that the problem of effi- cient XML summarization also arises in the 
context of selectivity estimation, where the synopsis is only used to estimate the size ... 
Cited by 89 - Related articles - All 25 versions 
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Cost modeling and estimation for OLAP-XML federations 

D Pedersen, K Riis, T Pedersen - Data Warehousing and Knowledge 2002 - Springer 

... The query evaluation time will depend eg on the aggregation level and selectivity of any selections 

in a query. ... The cost formula distinguishes between two types of XML query results: Those that 

have been inlined in some predicate and those that have not been inlined in ... 
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FleXPath: flexible structure and full-text querying for XML 

S Amer-Yahia, LVS Lakshmanan, S ... - Proceedings of the 2004 2004 - portal.acm.org 
... Q1 by relax- ing the contains($3, "XML" and "streaming") to contains($2, "XML" and "streaming"). ... 
We could replace value-based pred- icates, eg, $i.price < 98 with $i.price < 100. We could also 
relax the contains predicate by making use of thesauri and replacing keywords with ... 
Cited by 1 63 - Related articles - All 1 9 versions 

Query processing for high-volume XML message brokering 

Y Diao, M Franklin - ... of the 29th international conference on Very 2003 - portal.acm.org 

... the extension of these paths does not add signifi- cantly to their processing cost. ... section> <section> 

<figure> <title> XML processing </title> </figure> </section> <figure> <title> XML process- ing ... 

title> </figure> </section>" In this case, the stream for the predicate path "//section ... 

Cited by 90 - Related articles - BL Direct - All 25 versions 

StatiX: making XML count 

J Freire, JR Haritsa, M Ramanath, P Roy, ... - Proceedings of the 2002 - portal.acm.org 
... Histograms are attractive because they are simple to implement, have been well- studied, and 
proven to be effective for selectivity estimation [18, 19]. ... Figure 5: Transformed Schema and 
Associated Statistics discuss in Section 4, in cost-based XML storage design, transfer ... 
Cited by 1 34 - Related articles - All 1 2 versions 

Design and evaluation of alternative selection placement strategies in optimizing continuous gueries 

J Chen, DJ DeWitt, JF Naughton - icde, 2002 - computer.org 

... R,S XML files AR, AS delta files for R, S |R|, |S| number of pages in R, S, assuming |R| = |S| 300 
||R||, ||S ... Cc CPU cost of evaluating a predicate (ms) 0.36ms ... y i the selectivity of a selection predicate 
7 u selectivity of the union of all selection predicates in a selection signature 0.8 ... 
Cited by 97 - Related articles - BL Direct - All 7 versions 
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Path sharing and predicate evaluation for high-performance XML filtering psu.edu [PDF] 

Y Diao, M Altinel, MJ Franklin, H Zhang, P ... - ACM Transactions on 2003 - portal.acm.org 
... highly skewed, with a small number of very long candidate lists that do not provide much selectivity. ... 
As experi- mental results in Altinel and Franklin [2000] show, this additional cost is far outweighed ... 
a document arrives at the filtering en- gine, it is run through an XML Parser that ... 
Cited by 273 - Related articles - BL Direct - All 25 versions 

Benefits of path summaries in an XML query optimizer supporting multiple access methods 

A Barta, MP Consens, AO Mendelzon - Proceedings of the 31st 2005 - portal.acm.org 
... The advantage of using this strategy occurs in the case of selective predicates on one ... follows: 
using path summaries, simple sta simple cost-based heuristics we can achieve s the ... DataGuides 
[14] (the path summ in Lore) are essentially the same struct considering XML trees). ... 
Cited by 31 - Related articles - BL Direct - All 4 versions 

Bloom histogram: Path selectivity estimation for xml data with updates 
W Wang, H Jiang, H Lu, JX Yu - ... on Very large data bases-Volume 2004 - portal.acm.org 
... A bad es- timate may mislead the query optimizer to choose a bad plan, whose cost could be 
orders of magni- tude higher. ... We present the basic structure of a bloom histogram, and the 
algorithms to estimate the selectivity of XML path expressions using bloom histograms. ... 
Cited by 38 - Related articles - BL Direct - All 20 versions 

Efficient processing of XML twig queries with QR-predicates 

H Jiang, H Lu, W Wang - Proceedings of the 2004 ACM SIGMOD .., 2004 - portal.acm.org 
... decomposition-based approach has a serious disadvantage: we may scan same data multiple 
times, incurring more I/O and CPU cost. ... n3: /title n4: /year n5: //author 'XML' 2003 ... Definition 2 
[OR-predicate] Given a query tree Q, an OR-predicate is a subtree in Q such that the root ... 
Cited by 78 - Related articles - All 17 versions 

Statistical learning techniques for costing XML queries 

N Zhang, PJ Haas, V Josifovski, GM ... - Proceedings of the 2005 - portal.acm.org 
... This behavior makes the cost modeling of XNav highly challenging. ... An XML element matches 
a parse-tree node if (1 ) the element name matches the node label, (2) the element value satisfies 
the value constraints if the node is also a predicate tree node, and (3) the element ... 
Cited by 37 - Related articles - BL Direct - All 13 versions 
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StatiX: making XML count 

J Freire, JR Haritsa, M Ramanath, P Roy, ... - Proceedings of the 2002 - portal.acm.org 
... Histograms are attractive because they are simple to implement, have been well- studied, and 
proven to be effective for selectivity estimation [18, 19]. ... Figure 5: Transformed Schema and 
Associated Statistics discuss in Section 4, in cost-based XML storage design, transfer ... 
Cited by 1 34 - Related articles - All 1 2 versions 

Approximate XML query answers 

N Polyzotis, M Garofalakis, Y loannidis - Proceedings of the 2004 2004 - portal.acm.org 
... answers is a cost-effective solution for offsetting the high evaluation cost of XML ... proposed 
summarization techniques: they enable more accurate approximate answers and selectivity 
estimates, and at ... qO being a distinguished root node always bound to the XML document root ... 
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[PDF] Cost-based optimization in DB2 XML 

A Balmin, T Eliaz, J Hornibrook, L Lim, GM Lohman, ... - IBM Systems 2006 - Citeseer 

... that might satisfy those predicates; that is, nodes satisfying the individual predicates must descend ... 

Fanout is used in conjunction with the traditional notion of selectivity in determining the ... Statistics 

collection" describes the set of statistics used to make XML cost and cardinality ... 
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Tree pattern aggregation for scalable XML data dissemination 

CY Chan, W Fan, P Felber, M Garofalakis, ... - Proceedings of the 2002 - portal.acm.org 
... and LDAP applications [3], Compared to earlier work based on attribute/predicate-based subscrip ... 
efficient method that exploits coarse statistics on the underlying distribution of XML documents 
to ... ically, our scheme employs the document statistics to estimate the selectivity of a ... 
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Query caching and view selection for XML databases 

B Mandhani, D Suciu - ... of the 31st international conference on Very 2005 - portal.acm.org 
... The selectivity of our SQL query depends a lot on this choice. ... of locality, the test workload will 
have overlap with S. We now want to insert views so as to be able to answer the queries in S. 
The main cost in warm-up is that of executing queries on the XML database to bring ... 
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